RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: / o/u^O; 3 6^^ 

Source; 0\ P ^ ^ 

-iDate Processed bj^STIC: /0^3 o/O^T 

THE ATTACHED PRINI^OUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 703-308-4212; FAX: 703-308^221 
Effective 12/13/03 : TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 

http://www.uspto.gov/web/oflices/pac/checker/chkr41note,htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that tliere 

a possibility that tlie disk/CD-Rom may have been ciifected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for llie disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for tlie 

United States Patent and Trademark Office, and instead should be sent via tlie following to ttie indicated addresses: 

1. EFS-Bio (<http://www>uspto.gov/ebc/efs/downIoads/documents,htm> . EPS Submission 
User Manual - ePAVE) 

2. U.S. Postal Sci-vice: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry directly to (EFFECTIVE 12/01/03): 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B'03, Crystal Plaza Two, 
20 1 1 Soutli Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, OT'Other delivery service to: U*S. Patent and Trademark Office, 
Box Sequence, Room *B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Plate, Arlington, VA 22202 



BIOTECHNOLOGY ^ ^xr- O - 




Revised 10/08/03 



Raw Sequence Listing Error Summaiy 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS^<VHICH WERE INSERTED BY PTO SOFTWARE 



1 Wrapped Nucleics 

Wrapped Aminos 



The number/text at the end of each line *^vrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping" 



_Invalid Line Length The rules require that a line not exceed 72 characters in length, this includes white i^aces. 



^Misaligned Amino 
Numbering 

_Ndh.ASCII 
_Variable Length 



Patentin 2.0 
"bug" 



(NEW RULES) 



9 Use of n*s or Xaa's 

(NEW RULES) 

10 K I nvalid <213> 

Response 

11 Useof<220> 



The numbering under each 5*^ amino acid is misaligned. Do not use tab codes between numbers; 
use space cliaracters, instead. 

The submitted file was not saved in ASCII(IX)S) text, as required by the Sequence RulSt^Plcase 
ensure your subsequent submission b saved in ASCII text. 

• f 

\ 

Sequence(s) contain n*s or Xaa*s representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentin version 2.0 has caused to <220>-<223>isectionto be missing from amino acid 

sequences(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the'^^iHSlquent amino acid sequence. Tills applies to the mandatory <220>-<223> sections for 
Artindai or Unknown sequences. 



_Skipped Sequences Sequence(s) 
(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPnON:SEQ ID NO:X: (insert SEQ*ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(>i) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



8 ^Skipped Sequences Sequence(s) 



missing. If intentional, please insert the following lines for each skipped sequence. 



<2 10> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n*s or Xaa*s are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species).. <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequence(s) 



_ missing the <220> "Feature" and associated numeric identifiers and responses. 



12 



Patentin 2.0 
"bug" 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104. pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing), instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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OIPE 



RAW SEQUENCE LISTING DATE: 10/30/2003 

PATENT APPLICATION: US/10/690,359 TIME: 14:05:59 

Input Set : A:\XTR004 CIP DIV l.ST25.txt 
Output Set: N:\CRF4\10302003\J690359.raw 

3 <110> APPLICANT: GERDES, JOHN C. 

4 MARMARO, JEFFREY 

5 IVES, JEFFREY 

6 ROEHL, CHRISTOPHER 

8 <120> TITLE OF INVENTION: NUCLEIC ACID ARCHIVING 
10 <130> FILE REFERENCE: XTR004 CIP DIV 1 
C — > 12 <140> CURRENT APPLICATION NUMBER: US/10/690,359 
C — > 12 <141> CURRENT FILING DATE: 2003-10-21 

12 <150> PRIOR APPLICATION NUMBER: 09/944,604 

13 <151> PRIOR FILING DATE: 2001-08-31 

15 <150> PRIOR APPLICATION NUMBER: 09/061,757 

16 <151> PRIOR FILING DATE: 1998-04-16 

18 <150> PRIOR APPLICATION NUMBER: 60/041,999 

19 <151> PRIOR FILING DATE: 1997-04-16 
21 <160> NUMBER OF SEQ ID NOS : 10 

23 <170> SOFTWARE: Patentin version 3.2 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 21 

27 <212> TYPE: DNA 

28 <213> ORGANISM: Cryptosporidium parvum 

30 <4 00> SEQUENCE: 1 

31 gaggatagag gcatttggtt g 21 

34 <210> SEQ ID NO: 2 

35 <211> LENGTH: 20 

36 <212> TYPE: DNA 

37 <213> ORGANISM: Cryptosporidium parvum 

39 <400> SEQUENCE: 2 

40 gttttgtagg ggtcgctcat 20 

43 <210> SEQ ID NO: 3 

44 <211> LENGTH: 100 

45 <212> TYPE: DNA 

46 <213> ORGANISM: Cryptosporidium parvum 
4 8 <4 00> SEQUENCE: 3 

4 9 ctatatcgta atacgctctg attacgtagg gagtggtact cctaacagta ggcctctgat 60 
51 ttgtcagtcg acataccgct gcgctcaaat ccttttagaa 100 

54 <210> SEQ ID NO: 4 

55 <211> LENGTH: 15 

56 <212> TYPE: DNA 

57 <213> ORGANISM: Mycobacterium tuberculosis 

59 <400> SEQUENCE: 4 

60 cgatcgagca agcca 15 

63 <210> SEQ ID NO: 5 

64 <211> LENGTH: 15 



file://C:\CRF4\Outhold\VsrJ690359.htm 
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RAW SEQUENCE LISTING DATE: 10/30/2003 

PATENT APPLICATION: US/10/690,359 TIME: 14:05:59 

Input Set : A:\XTR004 CIP DIV l.ST25.txt 
Output Set: N:\CRF4\10302003\J690359.raw 

65 <212> TYPE: DNA 

66 <213> ORGANISM: Mycobacterium tuberculosis 

68 <400> SEQUENCE: 5 

69 cgagccgctc gctga 15 

72 <210> SEQ ID NO: 6 

73 <211> LENGTH: 40 

74 <212> TYPE: DNA,^ 

75 <213> ORGANISM 
77 <400> SEQUENCE 



:[ to use with M. Tuberculosis ) i{owL^(<=> ^^r-f 

78 accgcatcga atgcatgtct cgggtaaggc gtactcgacc Q^Of^ Sufl^^ /T" 
81 <210> SEQ ID NO: 7 * ^ 

40 / 
A ' ^ 

: (to use with M. Tuberculosis I 



81 <210> SEQ 

82 <211> LENGTH: 40 

83 <212> TYPE: DNA 

84 <213> ORGANISM: 

86 <4 00> SEQUENCE 

87 cgattccgct ccagacttct cgggtgtact gagatcccct 40 

90 <210> SEQ ID NO: 8 

91 <211> LENGTH: 28 

92 <212> TYPE: DNA 

93 <213> ORGANISM: Homo sapiens 

95 <4 00> SEQUENCE: 8 

96 ataatccacc tatcccagta ggagaaat 28 

99 <210> SEQ ID NO: 9 

100 <211> LENGTH: 28 

101 <212> TYPE: DNA 

102 <213> ORGANISM: Homo sapiens 

104 <400> SEQUENCE: 9 

105 tttggtcctt gtcttatgtc cagaatgc 28 

108 <210> SEQ ID NO: 10 

109 <211> LENGTH: 100 

110 <212> TYPE: DNA 

111 <213> ORGANISM: Homo sapiens 

113 <400> SEQUENCE: 10 

114 atcctatttg ttcctgaagg gtactagtag ttcctgctat gtcacttccc cttggttctc 60 
116 tcatctggcc tggtgcaata ggccctgcat gcactggatg 100 
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VERIFICATION SUMMARY DATE: 10/30/2003 

PATENT APPLICATION: US/10/690,359 TIME: 14:06:00 

Input Set : A:\XTR004 CIP DIV l.ST25.txt 
Output Set: N: \CRF4\10302003\J690359 , raw 

L:12 M:270 C: Current Application Number differs. Replaced Current Application No 
L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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